SQUINKY! A Corpus of Sentence-level Formality, Informativeness, and Implicature

نویسنده

  • Shibamouli Lahiri
چکیده

We introduce a corpus of 7,032 sentences rated by human annotators for formality, informativeness, and implicature on a 1-7 scale. The corpus was annotated using Amazon Mechanical Turk.1 Reliability in the obtained judgments was examined by comparing mean ratings across two MTurk experiments, and correlation with pilot annotations (on sentence formality) conducted in a more controlled setting. Despite the subjectivity and inherent difficulty of the annotation task, correlations between mean ratings were quite encouraging, especially on formality and informativeness. We further explored correlation between the three linguistic variables, genre-wise variation of ratings and correlations within genres, compatibility with automatic stylistic scoring, and sentential make-up of a document in terms of style. To date, our corpus is the largest sentence-level annotated corpus released for formality, informativeness, and implicature.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pragmatic tolerance: Implications for the acquisition of informativeness and implicature

Recent investigations of the acquisition of scalar implicature report that young children do not reliably reject a sentence with a weak scalar term, e.g. 'some of the books are red', when it is used as a description of a situation where a stronger statement is true, e.g. where all the books are red. This is taken as evidence that children do not interpret the sentence with the implicature that ...

متن کامل

Comedy, Context and Unsaid Meaning: A Case Study in Conversational Implicature

Pragmatics moves away from the word level and sentence level study of language towards the study of language in real-world context and at discourse level whereby two or more participants take part in conversation. There are moments when the speaker explicitly says something but the listener may have other interpretations and inferences from their statements. The aim of this study was to demonst...

متن کامل

Resolving Quantity- and Informativeness-Implicature in Indefinite Reference

A central challenge for all theories of conversational implicature (Grice, 1957, 1975) is characterizing the fundamental tension between Quantity (Q) implicature, in which utterance meaning is refined through exclusion of the meanings of alternative utterances, and Informativeness (I ) implicature, in which utterance meaning is refined by strengthening to the prototypical case (Atlas & Levinson...

متن کامل

Inter-rater Agreement on Sentence Formality

Formality is one of the most important dimensions of writing style variation. In this study we conducted an inter-rater reliability experiment for assessing sentence formality on a five-point Likert scale, and obtained good agreement results as well as different rating distributions for different sentence categories. We also performed a difficulty analysis to identify the bottlenecks of our rat...

متن کامل

Optimizing Informativeness and Readability for Sentiment Summarization

We propose a novel algorithm for sentiment summarization that takes account of informativeness and readability, simultaneously. Our algorithm generates a summary by selecting and ordering sentences taken from multiple review texts according to two scores that represent the informativeness and readability of the sentence order. The informativeness score is defined by the number of sentiment expr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1506.02306  شماره 

صفحات  -

تاریخ انتشار 2015